Towards Zero-Shot Autonomous Inter-Task Mapping through Object-Oriented Task Description
نویسندگان
چکیده
The successful use of Reinforcement Learning in complex tasks depends on techniques to scale-up classical learning algorithms because they suffer from the curse of dimensionality. Transfer Learning approaches have been used to accelerate learning by reusing knowledge gathered from the solution of previous tasks. However, discovering how different tasks are related is a very complex undertaking if a human is not available (or is unable) to manually establish a mapping between tasks. We here propose an algorithm to autonomously estimate a Probabilistic Inter-TAsk Mapping (PITAM) across tasks described in an object-oriented manner, which requires less domain knowledge than a handcrafted Inter-Task Mapping. We also propose two strategies for Temporal-Difference algorithms to transfer knowledge using learned PITAMs. Our experiments evaluate varied scenarios in which the source and target tasks differ in several aspects, and our proposal presents benefits over both regular learning and Q-value Reuse using a detailed InterTask Mapping.
منابع مشابه
Supplementary Material: Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning
Inter/Extrapolation. In this experiment, a task is defined by three parameters: action, object, and number. The agent should repeat the same subtask for a given number of times. The agent is trained on all configurations of actions and target objects. However, only a subset of numbers is used during training. In order to interpolate and extrapolate, we define analogies based on simple arithmeti...
متن کاملUsing Task Descriptions in Lifelong Machine Learning for Improved Performance and Zero-Shot Transfer
Knowledge transfer between tasks can improve the performance of learned models, but requires an accurate estimate of the inter-task relationships to identify the relevant knowledge to transfer. These inter-task relationships are typically estimated based on training data for each task, which is inefficient in lifelong learning settings where the goal is to learn each consecutive task rapidly fr...
متن کاملAudience awareness of Persian learners of English writing: Towards a model of task-oriented strategies
Persian learners of English often avoid attending to audience considerations, which brings them lower scores. The present study was conducted in a major university in Iran to help Persian learners develop a sense of audience awareness in writing. Thirty five Persian students of English were trained with a focus on process-oriented instruction. The intended task was a...
متن کاملUsing Task Features for Zero-Shot Knowledge Transfer in Lifelong Learning
Knowledge transfer between tasks can improve the performance of learned models, but requires an accurate estimate of the inter-task relationships to identify the relevant knowledge to transfer. These inter-task relationships are typically estimated based on training data for each task, which is inefficient in lifelong learning settings where the goal is to learn each consecutive task rapidly fr...
متن کاملAutonomous Attentive Exploration in Search and Rescue Scenarios
In task-oriented exploration a robot has to direct its sight and delving towards the most promising regions of the environment, according to the task, in order to optimize its search. If the goal is dynamically set on the basis of what it is perceived, attention plays a crucial role, as it allows to combine fast glancing with accurate analysis, enabling the robot to quickly jump to conclusion b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017